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This short technical manuscript contains supporting information for Ref. We consider the 
RIPE WHOIS internet data as characterized by the Cooperative Association for Internet Data 
Analysis (CAIDA) [2], and show that the Tempered Preferential Attachment (TPA) model [l| 
provides an excellent fit to this data. First we define the complementary cumulative probability 



distribution (ccdf), and then derive the ccdf for a TPA graph. Next we discuss the ccdf for the 
WHOIS data. Finally we discuss the fit provided by the TPA model and by a power law with 
q \ exponential decay (PLED). 
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> 1 I. DEFINING THE CCDF 

00 ' 

The complementary cumulative probability distribution, ccdf(x): 

C _ ) x—l oo 

ccdf (x) = 1 - J2Pj = Yl p i ■ W 

j=l j= x 
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■ A. First recall the recursion relations 



II. THE CCDF PREDICTED BY TPA WITH A l ^ A 2 



The recursion relations defining the degree distribution for TPA graphs were derived explicitly 
in Refs. and 4]. Here we derive the corresponding ccdf. These are Eqn's (16) and (17) in Qj: 



k-l\ ('{4 k 



and 



Note 



k + w 1 11 fc + w + 1 

\k=2 / \k=l 



A 2 \ „ . i-A 2 



and, for convenience, we defined: 



A 2 



(5) 



A 2 +w j 

We will first calculate the CCDF for i > A2 as we will use that result to determine the CCDF 
for i < A 2 . 



B. Calculating the CCDF, for x > A 2 



Recall the definition of the CCDF from Eqn. (pQ): 

00 

ccdf(x) = y^pj 



J=X 



3=X 

00 



3=0 



j=0 



(6) 



Since q < 1, the sum in Eqn. © is a geometric series; I 3 = V(l ~~ <?)• Thus we can write: 



ccdf(x) = (^T^) ff"""* 3 . for * > A i 



(7) 



C. Calculating the CCDF, for x < A 2 



This is slightly more complicated, as we have different functional forms for x < A 2 and x > A 2 . 



ccdf(x) = pj 



J=X 

A2-1 00 

j=x j=A 2 
A 2 -l 

Pj + ccdf (A 2 ) 

j=x 

A 2 -l , \ 

j=x 



(8) 
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Plugging in the relation for pi from Eqn. ([3]), we obtain: 



ccdf(x) =p A2 I + II k+ fc + 1 I ■ lor 1 < A ' 2 ' 

j=x k=j 



(9) 



D. Standard Normalization 



First we can check that Eqns. (|7|) and ([9]) give the same value for ccdf(^2)- They do: 

ccdf(A 2 ) = 

1-q 

And we can determine the value of pa 2 by the normalization condition that 

A 2 -lA 2 -l , 
i \ - rr k + W + 1 



ccdf(i)=i= PA2 — _+ n 

q 3=1 k=j 



In other words, 



A 2 -i A,-i 
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A; + w + 1 



j=i fc=j 



(10) 
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E. Normalizing without degree d = 1 nodes 



We may want to neglect nodes with degree d < 2 for various reasons. In that case, the normal- 
ization would be: 

A2-1A2-1 



ccdf(2)=i=^ 2 _|_+ £ n 



i=2 fc=j 



k + w + 1 
~~k 



(13) 



Thus 



PA 2 



A2-1A2-: 



-1 



i=2 fc=j 



& + + 1 



(14) 



with Eqns. (JT]) and Q unchanged (except Eqn. Q now holds for 2 < x < A2, rather than for 
1 < x < A 2 ). 
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III. THE WHOIS CCDF, FOR d > 1 
A. Whois data, renormalize to remove d < 2 



By definition: 



Thus: 



5> = L 



5^ Pi = i-pi- 

i=2 



We want to renormalize (p'- = r/pj) such that: 



3=2 3=2 



Thus 77 = 1/(1 - pa). For the Whois data, p 1 = 0.0573. and rj = 1.0608. 

The complementary cumulative distribution function (ccdf) for the renormalized prob- 
abilities: 



ccdf'(x) = ^2 Pj = V Yl p i = 71 ccdf ( x ) 



Whois original CCDF, and pi's removed 
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FIG. 1: Original CCDF of Whois data, and the renormalized CCDF'(x) = rj CCDF(x). 
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IV. FITTING TPA TO WHOIS WITH d > 2 

Whois d > 2 distribution discussed above. TPA with d > 2 is the same as with d > 1 except 
the value of pa 2 is defined as in Eqn. (fill) , in terms of d = 2 instead of (i = 1. 

Whois, d>1, and CIPA with A1=187, A2=90 
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FIG. 2: Whois CCDF for d > 2. Data points are from the Whois tables. The solid line is the fit to TPA for 
d > 2 with Ax = 187 and A 2 = 90 (and thus 7 = 1.83). With this fit, R = 0.986, thus R 2 = 0.972. 



V. FITTING PLED TO WHOIS WITH d > 2 

Assuming a PLED: p(x) = Ax~ b exp(— x/c). The normalization constant, A, is determined by 
the relation: 

00 00 
J ^2p(x) = 1 = A x ~ b exp(— x/c). 

x=1 x=2 

Then the ccdf: 

00 

ccdf (a;) = A x~ b exp(— x/c). 

j=x 
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Whois ccdf 
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FIG. 3: Whois CCDF for d>2. Data points are from the Whois tables. The solid line is the fit ccdf (a;) — 
AJ2JL X x~ h exp(-x/c), where b = 1.63 and c = 350. With this fit, R = 0.985, thus R 2 = 0.970. 
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